Integration of Biological Sources: Exploring the Case of Protein Homology

نویسندگان

  • Tjeerd W. Boerman
  • Maurice van Keulen
  • Paul van der Vet
  • Edouard I. Severing
چکیده

Data integration is a key issue in the domain of bioinformatics, which deals with huge amounts of heterogeneous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioinformatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Uncertain databases are able to contain several possible worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequencing And Bioinformatics Analysis of the Partial Promoter Region of Κ-Casein (CSN3) Gene in Iranian Bacterianus and Dromedaries Camels

κ-casein is a glycosilated protein belonging to a family of phosphoproteins (αs1,β, αs2,κ) that represents the major protein component in mammalian milk. κ-casein plays an essential role in the case of micelle stabilization, determining the size and the specific biological function. In the present study, we report the characterization of the partial sequence of κ-casein promoter region in the I...

متن کامل

Sequencing And Bioinformatics Analysis of the Partial Promoter Region of Κ-Casein (CSN3) Gene in Iranian Bacterianus and Dromedaries Camels

κ-casein is a glycosilated protein belonging to a family of phosphoproteins (αs1,β, αs2,κ) that represents the major protein component in mammalian milk. κ-casein plays an essential role in the case of micelle stabilization, determining the size and the specific biological function. In the present study, we report the characterization of the partial sequence of κ-casein promoter region in the I...

متن کامل

Computational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)

Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...

متن کامل

Uncertain Groupings: Probabilistic Combination of Grouping Data

Probabilistic approaches for data integration have much potential [7]. We view data integration as an iterative process where data understanding gradually increases as the data scientist continuously refines his view on how to deal with learned intricacies like data conflicts. This paper presents a probabilistic approach for integrating data on groupings. We focus on a bio-informatics use case ...

متن کامل

In Silico and in Vitroinvestigations on cry4aand cry11atoxins of Bacillus thuringiensis var Israelensis

In the present study we attempted to correlate the structure and function of the cry11a (72 kDa) and cry4a (135 kDa) proteins of Bacillus thuringiensis var israelensis. Homology modeling and secondary structure predictions were done to locate most probable regions for finding helices or strands in these proteins. The JPRED (JPRED consensus secondary structure prediction server) secondary struct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011